The Case for Building High Performance Intermediate-Level Memory Systems

نویسندگان

  • Junyi Xie
  • Gershon Kedem
چکیده

This paper presents the architecture of a high-performance intermediate-level memory subsystem. The memory subsystem is designed to narrow the growing gap between processors’ cycle time and the round-trip delay to main memory. The system is built from high-performance DRAM arrays and SRAM buffers, both integrated on the same IC. We call these parts Integrated Static and Dynamic Random Access Memory (ISDRAM). The ISDRAM system is configured as a very large cache and can be implemented either as an on-chip cache integrated with the CPU, or as a much larger external cache. We show that building very large caches can be an effective way to narrow the growing CPU-memory speed gap. Building a cache as a combination of DRAM and SRAM yields a system with a relatively low cost per bit while achieving performance that is close to that of a system with a very large cache made of (expensive) fast SRAM. We demonstrate that ISDRAM cache, containing SRAM buffers that are only 1/64th the size of DRAM, performs almost as well as a conventional SRAM cache of the same size. “Back of the envelope” estimates show that the ISDRAM bit-density should be approximately 1/2 the bit-density of commercial DRAM parts and at least 5x denser than a conventional on-chip SRAM cache of the same size. 1.0 The Case for High Performance Intermediate-Level Memory System The CPU-memory speed gap poses a considerable performance barrier. To illustrate the problem, 9 SPEC benchmarks were selected. SimpleScalar processor [2] with a 256KB L2 cache (a baseline configuration) running at 2Ghz, 4GHz and 8GHz was simulated executing these benchmarks. A main-memory model with delays independent of the processor clock frequency was used. All other delays (including L1 and L2 cache delays) were modeled in numbers of clock cycles. Simulation details are given in section 4. For each benchmark, the time (#_clocks x clock_period) it took the processor to execute a fixed number of instructions using different clock frequencies, was used to estimate the fraction of time the processor is idle waiting on memory requests. The computation is as follows: We assume that the delay is a linear function of the clock period, that is, the time it takes to execute N instructions T=A*t + B, where t is the clock period. The constant A capture the system’s frequency dependent component and B is the clock independent component, namely B=Idl-time. This is only an approximation, since a superscalar processor is a complex dynamic system that is sensitive to relative component delays. However, checking the model accuracy by calculating the discrepancy between the linear model and simulated execution times show the error to be small (~1%). If T1 is the time it takes to execute N instructions at one frequency, and

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application of Shape Memory Alloys in Seismic Isolation: A Review

In the last two decades, there has been an increasing interest in structural engineering control methods. Shape memory alloys and seismic isolation systems are examples of passive control systems that use of any one alone, effectively improve the seismic performance of the structure. Characteristics such as large strain range without any residual deformation, high damping capacity, excellent re...

متن کامل

Parametric Energy Analysis in Early Stage Design: Case of High-rise Office Buildings in Tehran

Nowadays, the high rates of energy consumption of buildings in contemporary societies, as well as in global problematics such as climate change, depletion of natural resources and environmental pollution, is widely recognized. In this context, energy efficiency of buildings has become a crucial aspect of national and international energy policies. Nevertheless, the design of high performance bu...

متن کامل

Parametric Energy Analysis in Early Stage Design: Case of High-rise Office Buildings in Tehran

Nowadays, the high rates of energy consumption of buildings in contemporary societies, as well as in global problematics such as climate change, depletion of natural resources and environmental pollution, is widely recognized. In this context, energy efficiency of buildings has become a crucial aspect of national and international energy policies. Nevertheless, the design of high performance bu...

متن کامل

How Effectiveness Of Comprehensive Performance Measurement Systems on Manager's Performance Through Modification of Mental Models (Learning Process)

One of the ways to reduce agency costs is to plan for the creation of effective decision-making information by designing appropriate comprehensive performance evaluation systems according to managers' learning process One of the important factors in the processing and classification of information for cognitive learning is mental models that are categorized in two dimensions of mental model co...

متن کامل

Investigating the role of high-rise building shell elements in reducing energy consumption (case example: Isfahan Cascade doctors' residential towers)

Significant energy is used to provide and maintain environmental conditions for thermal comfort in high-rise residential buildings, which largely depends on the components of the shell design. Therefore, to achieve high amounts of energy savings in buildings, high-impact design measures must first be defined and then optimized. This study seeks to answer the question of how the components of th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002